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Based on patent WO 200055765 



Abstract (Basic) : WO 200055765 Al 

NOVELTY - A method for sorting/ searching /abstracting documents 
in various fields comprising subjecting existing documents to word 
separation and analysis, dividing the words into groups of words 
having high degree of relationship, the degree of correlation, and the 
frequency of appearance, extracting the classification representing the 
feature of each group , imparting the classification to the words in 
each group , and recording the words with classification in a word 
dictionary, creating a text of a newly stored document, subjecting 
the text to word separation and analysis by using the word 
dictionary, identifying the classification by word analysis, imparting 
the classification to the word or words extracted, and records the 
document as knowledge data, selecting words belonging to the same 
group including the inputted word or the word extracted, and 
searching the stored document to create an abstract . 
USE - None given, 
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Title Terms: METHOD; SORT; SEARCH ; ABSTRACT; DOCUMENT 

Derwent Class: T01 

International Patent Class (Main) : G06F-017/30 
International Patent Class (Additional) : G06F-019/00 
File Segment: EPI 



13/5/16 (Item 11 fr^ file: 350) 

DIALOG (R) File 350 : Derwe^^WPIX 

(c) 2003 Thomson Derwen^ All rts. reserv. 



012954287 **Image available** 
WPI Acc No: 2000-126137/200011 

Related WPI Acc No: 1999-154021; 2000-627613; 2001-449861 
XRPX Acc No: NOO-095089 

Database organizing method for internet 
Patent Assignee: CULLISS G.(CULL-I) 
Inventor: CULLISS G 
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Patent No Kind Date Applicat No Kind Date Week 

US 6014665 A 20000111 US 97904795 A 19970801 200011 B 

US 97960140 A 19971029 

Priority Applications (No Type Date): US 97960140 A 19971029; US 97904795 A 

19970801 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 

US 6014665 A 16 G06F-017/30 CIP of application US 97904795 

Abstract (Basic): US 6014665 A 

NOVELTY - An index for storing specific key terms corresponding 
to each term group is set when a primary search query is 
received. The key terms matched with index and other groups are 
stored. Then, the index value is altered relative to matching result. 
When a secondary search query is output, the index is checked for 
key terms groupings defined by the users. 

DETAILED DESCRIPTION - After checking index for key terms , the 
key terms matching with close relation with the other key term 
groups is set. When the search query is completed, the index is 
altered that the key term matching score for the key term grouping 
is altered relative to other key term matching scores. The key term 
matching scores are divided by their respective key term matching 
scores for some of the key term groupings containing the primary key 
term to create a key term matching probability score for the 
respective key term groupings . 

. USE - To organize various database used in the internet. 
ADVANTAGE - Narrows down the searched results made available to 
the user, by monitoring searching activity of large number of 
internet users through an evolutionary process. 

DESCRIPTION OF DRAWING (S) - The figure shows the flow diagram of 
the operational steps to organize articles used in searching on the 
internet . 

pp; 16 DwgNo 1/1 
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Data recording sheet manufacturing method for data retrieval in library 
using computer - by expanding title and registration code of document 
with common keyword through display portion to provide new recording line 
at feed direction of recording sheet 

Patent Assignee: IIDA T (IIDA-I) 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

JP 9069108 A 19970311 JP 9590224 A 19950323 199720 B 

Priority Applications (No Type Date): JP 9590224 A 19950323 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
JP 9069108 A 11 G06F-017/30 

Abstract (Basic) : JP 9069108 A 

The method involves assigning a registration code (1) used in 
distinguishing a document from other document. A document title (2) and 
one keyword are extracted. The registration code, document title, and 
keyword are provided as one record. A document file is produced by 
extracting and recording several identical records from several 
documents . The keyword is extracted to prevent overlapping of 
keywords from the document file. The keywords are sequentially 
arranged and changed to produced a keyword file. The title and the 
registration code for document are expanded in the line direction of a 
recording sheet. The recorded data on the recording sheet is provided 
as a display portion (6) corresponding to one keyword of the document. 

The new line for the display portion of the document with one 
keyword, is provided for every document and recorded in the feed 
direction of the recording sheet. A keyword block (7) records one 
keyword- at a display portion group head. The keyword block is 
recorded on the recording sheet based on the keyword arrangement in 
the keyword file. 

ADVANTAGE - Enables simple data search from data recording sheet 
without using special apparatus, thus reducing manufacturing cost. 

Dwg.3/8 
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Computer system operating method for storing and retrieving data - 
creating signature file divided into subsets, mapping word signature 
to particular subset and storing subsets on storage device 

Patent Assignee: BURKOWSKI F J (BURK-I) 

Inventor: KREBS M S 

Number of Countries: 007 Number of Patents: 003 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

EP 304302 A 19890222 EP 88307650 A 19880818 198908 B 

US 4991087 A 19910205 US 88233601 A 19880818 199108 

CA 1291574 C 19911029 199151 

Priority Applications (No Type Date): GB 8719572 A 19870819 
Cited Patents: 3.Jnl.Ref; A3... 9124; No-SR.Pub; US 4183464 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
EP 304302 A E 11 

Designated States (Regional): DE FR GB IT NL 

Abstract (Basic) : EP 304302 A 

The method includes the steps of storing the database on the data 
storage device, creating for the database a signature file which is 
divided into subsets, mapping a word signature to a particular 
subset during creation of the file and storing the signature file 
subsets on the data storage device. Then, scanning for a word signature 
and retrieving the corresponding data from the data base in response to 
a query keyword by using the same information that was used to 
store the word signature in a particular subset. 

During the creation of the signature file for a particular 
document, all common words are ignored, a logical word signature is 
computed for each remaining word and, if logical word signatures are 
computed as hash values, any duplicate logical word signatures are 
eliminated. 

ADVANTAGE - Requires only single probe into signature file. 
2/2 
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searching text for keywords similar to users search request, 
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Abstract (Basic) : WO 8804454 A 

The information retrival method comprises the steps of defining as 
. a 'criterion key 1 that key-word which among all the keywords associated 
with any of the texts in the first group of texts , is associated 
with the largest number of texts within the first group. The first 
group is separated into 2 sub-groups, the first sub-group of texts 
having the criterion key as one of its keywords and the second sub- 
group not including the criterion key. 

Results obtained from the above steps are then displayed. The 
above process is applied recursively to at least one of the two 
sub-groupts . 

ADVANTAGE - Distinguishes between text areas having sense words 
but different meanings. 
2/16 

Title Terms: COMPUTER; INFORMATION; RETRIEVAL ; METHOD; PROCESS; TEXT ; 

SEARCH ; TEXT ; KEYWORD; SIMILAR; USER; SEARCH ; REQUEST; SELECT; TEXT 

; DATA; BASE 
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Processor for locating representations in stored textual data base - 
determines equivalent words for each query and processes 
identifiers in corresponding groups to form score for block 

Patent Assignee: SYSTEM DEV CORP (SYST-N) 

Inventor: DICKINSON R V; GAL IE L M 

Number of Countries: 009 Number of Patents: 004 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

EP 75903 A . 19830406 198315 B 

WO 8301328 A 19830414 198316 

JP 58501525 W 19830908 • 198342 
US 4495566 A 19850122 US 81307093 A 19810930 198506 

Priority Applications (No Type Date) : US 81307093 A 19810930 
Cited Patents: 3.Jnl.Ref; A3... 8516; No-SR.Pub; US 4270182 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
EP 75903 A E 92 

Designated States (Regional): BE DE FR GB IT LU NL 
WO 8301328 A E 

Designated States (National) : JP 

Abstract (Basic) : EP 75903 A 

The appts. uses a digital data processor and stored representations 
of a table of textual block identifiers for locating in a data base 
those blocks having the best match with a query . The representations 
of textual block identifiers are selectable from the table in groups, 
each group corresponding to a different word . Each textual block 
identifier provides an indication of a block containing the 
corresponding word. 

A query word is received having representations of words to be 
located . For each query " word , a corresponding set of equivalent 
words are determined and representations of the block identifiers in 
those groups corresponding to the equivalent words are processed 
to form a score for at least one block. The score provides an 
indication of the total number of the sets having at least one 
equivalent word in the block. 
Title Terms: PROCESSOR; LOCATE ; REPRESENT; STORAGE; TEXT ; DATA; BASE; 
DETERMINE; EQUIVALENT; WORD; QUERY ; PROCESS; IDENTIFY; CORRESPOND; 
GROUP; FORM; SCORE; BLOCK 
Derwent Class: T01 

International Patent Class (Additional) : G06F-007/28 ; G06F-013/00 ; 
G06F-015/40 

File Segment: EPI 
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09-128402 [JP 9128402 A] 
May 16, 1997 (19970516) 
OKA MAMIKO 

FUJI XEROX CO LTD [3597 61] (A Japanese Company or 

Corporation) , JP (Japan) 

07-281918 [JP 95281918] 

October 30, 1995 (19951030) 

[6] G06F-017/30 ; G06F-017/27 

4 5.4 (INFORMATION PROCESSING -- Computer Applications) 



ABSTRACT 

PROBLEM TO BE SOLVED: To provide a document similarity calculation device 
calculating the similarity of document data by reflecting the contents and 
the subject of information in it and a document sorting device sorting 
information based on the similarity. 

SOLUTION: A subject-related expression extraction part 2 reads out a piece 
of document data from a document data storage part 1, morpheme-analyzes the 
text part of document data and extracts a related expression expressing 
plural words and phrases and the relation between them from the result of 
morpheme analysis. Among the extracted related expressions, ones consisting 
of the same group of independent words and provided with a consistent 
relation are collected to count frequency. A related expression expressing 
the subject of the piece of document data is selected in accordance with a 
prescribed reference based on the obtained frequency. The similarity 
calculation part 3 calculates the similarity of optional two pieces of 
document data based on the similarity of the subject-related expression 
extracted by the subject-related expression extraction part 2. 
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NEC CORP [000423] (A Japanese Company or Corporation), JP 
(Japan) 

01-319199 [JP 89319199] 
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[5] G06F-015/40 

45.4 (INFORMATION PROCESSING — Computer Applications) 
Section: P, Section No. 1270, Vol. 15, No. 430, Pg. 69, 
October 31, 1991 (19911031) 



ABSTRACT 

PURPOSE: To obtain requested data with high probability by substituting and 
storing a word representing a word group for data to be registered, and 
also, substituting a representative word for the data inputted as 

retrieval information . 
CONSTITUTION: The system is equipped with a standardization part 4 which 
substitutes (standardize) the representative word of the word group 

having the same meaning for character type data inputted from an 
input /output part 1 at need and delivers it to a data processing part 2, 
and a dictionary data base 5 which stores the data to be standardized at 
the standardization part 4, and when the data to be registered is 
standardized, the representative word is stored in a data storage part 3. 
It follows that the retrieval of the word that coincides with the 
representative word can be performed by performing the standardization 
similarly even when a retrieval condition is inputted, and that it 
coincides with the all the words that belong to the same word group 
therefore, it is possible to retrieve the word by inputting a synonym if 
rough meaning is recognized even when no stored data itself is accurately 
recognized by a user. 
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Information processor displays relationship of convergence of text 

group based on common keyword number 
Patent Assignee: KONISHI K (KONI-I) 
Number of Countries: 001 Number of Patents: 001 
Patent Family: 
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Patent Details: 
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Abstract (Basic) : JP 2002149671 A 

NOVELTY - A processing unit detects number of keywords in a 
received information. A memory stores the keywords as a component of 
convergence of a text group with identical keywords . A 
detection unit detects the number of common keywords corresponding to 
total combination of the convergence. A display unit displays the 
relationship of the convergence based on the common keyword number. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are included for the 
following: 

(1) Information processing method; and 

(2) Recorded medium storing information processing program. 
USE - Information processor. 

ADVANTAGE - Relationship of convergence of text group is 
displayed intelligibly. 

DESCRIPTION OF DRAWING (S) - The figure shows a flowchart 
illustrating operation of the information processor. (Drawing includes 
non-English language text ) . 
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Information perusal support system in world wide web - uses keywords 
specified by user, closely similar keywords being grouped in same 
search space, for information retrieval 

Patent Assignee: FUJITSU LTD (FUIT ) 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

JP 11219369 A 19990810 JP 9822115 A 19980203 199942 B 
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Abstract (Basic) : JP 11219369 A 

NOVELTY - User specified keywords are used for information 
retrieval . Keywords that are closely similar are configured in the 
same information search space (1031) and are displayed for selection. 

USE - In searching for information using computers in world wide 

web. 

ADVANTAGE - The whole information is presented based on 
relationship between their various modules, thus ensuring their smooth 
perusal. DESCRIPTION OF DRAWING (S) - The figure shows the various 
functional blocks of information perusal support system. (1031) 
Information search space. 
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Document searching apparatus for searching document in database - has 
keyword expansion unit which performs hierarchical expansion of search 
string after extracting word from related word group according to set 
expansion condition 

Patent Assignee: FUJI XEROX CO LTD (XERF ) 

Number of Countries: 001 Number of Patents: 001 
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Patent No Kind Date Applicat No Kind Date Week 

JP 9198396 A 19970731 JP 964859 A 19960116 199741 B 

Priority Applications (No Type Date): JP 964859 A 19960116 
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Abstract (Basic) : JP 9198396 A 

The apparatus has a setting unit (110) which sets an expansion 
condition based on the thesaurus expansion of a keyword input by an 
input unit (100). A keyword expansion unit (130) extracts a word from 
related word group according to an extraction range which corresponds 
to the expansion condition for the related word group. 

The related word group has the same hierarchical value as 
that of the related word group in which the keyword in a thesaurus 
dictionary belongs. The word is extracted according to the set 
expansion condition. The keyword expansion unit executes the 
hierarchical expansion of a search string. 

ADVANTAGE - Improves document searching efficiency by reducing 
number of candidate documents . Enables hierarchical expansion of 
search string without generating fault. Improves document searching 
accuracy by reducing number of words which can not be used by user in 
search operation. Prevents reduction in rate of adaptation of 
predetermined document and search document during thesaurus expansion 
of input keyword. 
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ABSTRACT 

PURPOSE: To prepare the retrieval condition expression based upon the • 
retrieval intention of an operator. 

CONSTITUTION: The retrieval condition expression is inputted and it is 
decided whether or not there is plural common high-order words in key 
word groups connected in the retrieval condition expression under OR 
conditions by tracing a thesaurus file 50; when there are plural high-order 
words, the relation between the high-order word and a key word is 
and the operator specifies a high-order word matching the 
intention over a look at the display and adds the specified 
word to the retrieval condition expression under AND 
(steps 1-6) . The thesaurus file 50 is searched for other 
word groups belonging to the high-order words in the key 

groups connected under the OR conditions and when those low-order 
groups are not specified as key words in the retrieval 



displayed; 

retrieval 
high-order 
conditions 
low-order 

word 

word 

condition expression, the low- order words are displayed; and the operator 
specifies the displayed low-order words as key words, which are added as 
key words to the key word groups connected in the retrieval 

condition expression under the OR conditions (steps 7-10) . 
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ABSTRACT 

PURPOSE: To decrease the difference of the result of retrieval caused by 
a degree of skill of a user by constituting the a knowledge base retrieving 
system so that two kinds of retrievals of a regular keyword retrieval 
and a link retrieval can be executed. 

CONSTITUTION: At the time of regular keyword retrieval , keywords in 
the same group as a group belonging to an input keyword in the 

inside of a thesaurus are all drawn out by a link searching means 3. 
Subsequently, by a data retrieving means 5, knowledge data having these 

keyword groups as a keyword is fetched from a knowledge base in each 
separate input keyboard. On the other hand, at the time of link retrieval 
, first of all, by a link searching means 3, a retrieval expression 
obtained by replacing a keyword of a search link destination determined 
in accordance with priority of an inter- group link- and an input keyword 

with each other is generated, and based on the retrieval expression, 
knowledge data is fetched from a knowledge base 4 by the means 5. Also, the 
thesaurus can be corrected by a thesaurus constructing means 2. 



